Forward and Backward Speech Skimming with the Elastic Audio Slider
نویسندگان
چکیده
In pursuit of the goal to make recorded speech as easy to skim as printed text, a variety of methods and user interfaces have been suggested in the literature, involving time-compressed audio, speech segmentation and recognition, etc. We propose a new user interface, the elastic audio slider, which makes navigation in speech documents similar to video navigation or text scrolling. The approach supports navigation at variable speed in both forward and backward direction while providing immediate intelligible audio feedback during the user’s interactions. A user study was conducted to prove the usefulness of backward replay of speech for tasks such as topic classification. In addition, we show that the proposed interface offers the opportunity to combine the advantages of existing approaches within a single, easy-to-use UI component that complements and enhances the common user interfaces known from standard audio player software.
منابع مشابه
Interactive Manipulation of Replay Speed
Today’s interfaces for time-scaled audio replay have limitations especially regarding highly interactive tasks such as skimming and searching, which require quick temporary speed changes. Motivated by this shortcoming, we introduce a new interaction technique for speech skimming based on the so called rubberband metaphor. We propose an “elastic” audio slider which is especially useful for tempo...
متن کاملImplementing the New First and Second Differentiation of a General Yield Surface in Explicit and Implicit Rate-Independent Plasticity
In the current research with novel first and second differentiations of a yield function, Euler forward along with Euler backward with its consistent elastic-plastic modulus are newly implemented in finite element program in rate-independent plasticity. An elastic-plastic internally pressurized thick walled cylinder is analyzed with four famous criteria including both pressure dependent and ind...
متن کاملNew Touch Screen Application
An adaptive speech rate control technology for ultra fast listening that is equivalent to skimming is described. Nowadays, listening to audio books on mobile devices is quite common. People read books at various levels of detail from close reading to skimming. Although a similar feature to skimming is required to efficiently obtain information from audio sources, there is no tool equivalent to ...
متن کاملA Turbo-Decoding Weighted Forward-Backward Algorithm for Multimodal Speech Recognition
Since the performance of automatic speech recognition (ASR) still degrades under adverse acoustic conditions, recognition robustness can be improved by incorporating further modalities. The arising question of information fusion shows interesting parallels to problems in digital communications, where the turbo principle revolutionized reliable communication. In this paper, we examine whether th...
متن کاملTurbo Decoders for Audio-Visual Continuous Speech Recognition
Visual speech, i.e., video recordings of speakers’ mouths, plays an important role in improving the robustness properties of automatic speech recognition (ASR) against noise. Optimal fusion of audio and video modalities is still one of the major challenges that attracts significant interest in the realm of audiovisual ASR. Recently, turbo decoders (TDs) have been successful in addressing the au...
متن کامل